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1.  Introduction 

In  an  organization,  individuals  typically  differ  in  at  least  three  important 
respects:  (1)  they  control  different  action  variables,  (2)  they  base  their  decisions 
on  different  information,  (3)  they  have  different  goals.  Thus  it  would  seem  that 
the  theory  of  games  provides  the  most  suitable  mathematical  framework  for  the 
study  of  organizations.  However,  many  interesting  aspects  of  organizations  are 
related  to  differences  of  types  (1)  and  (2)  only.  Furthermore,  in  some  cases  the 
members  of  the  organization  may  have  nearly  identical  goals;  or,  as  in  the  case 
of  organizing  machines,  it  may  be  appropriate  to  consider  only  the  goal  of  the 
organizer.  Finally,  in  its  present  state  of  development,  the  theory  of  games  of 
more  than  two  persons  does  not  appear  to  provide  many  clues  as  to  how  to 
proceed  in  a  general  analysis  of  organizations. 

All  of  this  suggests  the  study  of  theoretical  organizations  in  which  differences 
of  type  (3)  are  absent,  that  is,  in  which  there  is  a  single  payoff  function  reflecting 
the  common  goals  of  the  members,  or  of  the  organizer.  J.  Marscbak  has  called 
such  an  organization  a  team  (see  [3]).  In  the  theory  of  teams,  as  in  statistical 
decision  problems  in  general,  two  basic  questions  are:  (a)  for  a  given  structure 
of  information,  what  is  the  optimal  decision  function?  (b)  what  are  the  relative 
values  of  alternative  structures  of  information?  For  example,  consider  an 
airline  company  with  a  number  of  ticket  agents  who  are  authorized  to  sell 
reservations  on  future  flights  with  only  partial  (if  any)  information  about  what 
reservations  have  teen  booked  by  other  agents.  One  can  study  the  best  rules  for 
these  agents  to  use  under  such  circumstances,  taking  account  of  the  joint  prob¬ 
ability  distribution  of  demands  for  reservations  at  the  several  offices,  the  losses 
due  to  selling  too  many  or  too  few  reservations  in  total,  and  so  forth.  One  can 
also  study  the  additional  value  that  would  result  from  providing  the  agents 
with  complete  information  about  the  other  reservations  already  booked;  such 
an  additional  value  figure  would  place  an  upper  limit  on  the  expense  that  it  would 
be  worthwhile  to  incur  in  order  to  provide  the  agents  with  that  information. 
M.  Beckmann  in  [1]  has  analyzed  airlines  reservations  problems  along  these 
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lines;  and  C.  B.  McGuire  [4]  has  studied  certain  other  models  of  sales  organiza¬ 
tion,  also  from  a  team  theoretic  point  of  view. 

In  this  paper  I  describe,  evaluate,  and  compare  certain  elementary  informa¬ 
tion  structures  in  teams.  Some  of  these  information  structures  (for  example, 
complete  information,  complete  decentralization)  are  of  interest  because  they 
are  in  a  sense  extreme;  they  are  useful  as  bases  of  comparison  with  other  infor¬ 
mation  structures.  The  others  represent  simplified  models  that  are  suggested  by 
common  organizational  devices.  The  reader  will  have  no  difficulty  in  recognizing 
the  primitive  character  of  these  models. 

The  entire  discussion  in  this  paper  is  restricted  to  the  case  in  which  the  payoff 
to  the  team  is  a  quadratic  function  of  the  action  variables,  for  each  possible 
state  of  the  world,  that  is,  for  each  specification  of  the  values  of  the  uncontrolled 
variables  in  the  environment.  The  methods  used  here  were  developed  in  [5]. 
Some  discussion  of  the  case  of  a  team  decision  problem  with  a  concave  poly¬ 
hedral  payoff  function  can  be  found  in  [6]  and  [7].  However,  in  that  case  explicit 
formulas  for  the  values  of  particular  information  structures  appear  to  be  very 
difficult  to  obtain,  making  it  more  difficult  than  in  the  quadratic  case  to  derive 
conclusions  about  the  relative  values  of  the  several  information  structures 
described  below. 


2.  Team  decision  problems  with  quadratic  payoff  functions 

As  already  mentioned,  in  the  team  decision  problems  to  be  considered  in  this 
paper,  the  action  variables  will  be  taken  to  be  real  variables,  and  the  payoff  to 
the  team  to  be  a  quadratic  function  of  the  action  variables,  for  every  state  of  the 
world.  Thus  let  the  action  variable  of  team  member  i  be  denoted  by  a»  (real), 
i  =  1,  *  •  •  ,  N,  let  the  state  of  the  world  be  denoted  by  x,  where  x  is  an  element 
of  some  probability  measure  space  ( X ,  9C,  p),  and  let  the  payoff  to  the  team  be 

(2.1)  co(x,  a)  =  v o  +  2 a'/x(x)  —  a'Qa , 

where  a  denotes  the  (column)  vector  with  coordinates  a,-,  for  i  =  1,  •  •  •  ,  N;  and 
n  is  a  measurable  vector-valued  function  on  X,  and  Q  is  a  fixed  positive  definite 
N  X  N  matrix.  Without  loss  of  generality  one  can  take  no  =  0.  The  measure  p 
expresses  the  uncertainty  about  which  state  of  the  world  actually  obtains. 

The  information  upon  which  the  several  team  members  base  their  decisions  is 
expressed  as  follows.  For  each  i  =  1,  •  •  •  ,  N,  let  (Yiy  <yt)  be  some  measurable 
space;  and  I\  represents  the  set  of  alternative  “signals”  that  can  be  received  by 
person  i.  Also,  for  each  t,  let  rji  be  a  measurable  function  from  X  to  Yi}  called  the 
information  function  for  person  i.  The  n-tuple  rj  —  (771,  •  •  •  ,  rjN)  will  be  called 
the  information  structure  for  the  team.  The  function  ra  determines  the  signals 
that  person  i  receives  under  the  alternative  states  of  the  world.  Thus 

(2.2)  yi  =  ru(x). 

Finally,  the  actions  of  the  team  members  are  to  be  determined  according  to 
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decision  functions  at-,  where  is  a  real-valued  measurable  function  on  7„ 
that  is, 

(2.3)  o»  =  «*(?/*)• 

The  vector  a  of  decision  functions  will  be  called  the  team  decision  function . 
Given  an  information  structure  77,  one  wishes  to  choose  a  team  decision  function 
so  as  to  maximize  the  expected  payoff 

(2.4)  n(v,  a)  =  Eu{x,  <*i [771(2:)],  •  •  •  ,  aArtwOs)]}. 

Beyond  the  choice  of  best  decision  functions  for  given  information  structures, 
it  is  of  interest  to  compare  alternative  information  structures  in  terms  of  the 
maximum  expected  payoff  that  can  be  derived  from  their  use.  It  is  convenient 
to  take  as  an  origin  for  measurement  the  maximum  expected  payoff  for  the 
“null”  information  structure,  which  provides  no  information  beyond  the  knowl¬ 
edge  of  p  itself.  Thus  the  value  of  an  information  structure  77  is  defined  by 

(2.5)  V(rj)  =  max  £2(77,  a)  —  max  Eu(x,  a). 

a  a 

In  this  paper  a  number  of  information  structures,  suggested  by  various  organiza¬ 
tional  devices,  will  be  analyzed,  from  the  point  of  view  of  determining  their 
values  and  the  corresponding  optimal  decision  functions. 


3.  Characterization  of  optimal  decision  functions  and  value  of 
information  in  the  quadratic  case 

In  this  section  the  main  tools  of  analysis  for  the  following  sections  are  pre¬ 
sented.  These  results  are  given  without  proof,  the  proofs  already  having  been 
given  in  an  earlier  paper  [5]. 

The  first  result  characterizes  the  best  team  decision  function  for  a  given  infor¬ 
mation  structure  in  terms  of  a  set  of  simultaneous  “stationarity”  conditions, 
which  can  be  derived  from  a  more  general  theorem,  according  to  which  under 
certain  conditions  a  team  decision  function  is  optimal  if  and  only  if  it  cannot  be 
improved  by  changing  only  one  component  decision  function  at  a  time. 

Theorem  1.  Suppose  Erf  <  00  9  for  i  =  1,  •  •  •  ,  N;  then  foi  any  information 
structure  rj  the  unique  (a.e.)  best  team  decision  function  is  the  solution  of 

(3.1)  qacti  +  £  qijE(aj\rji)  =  E(m\rji)y  i  =  1,  •  •  •  ,  N. 

Throughout  this  paper  it  will  be  assumed  that  Erf,  <  «>,  for  i  =  1,  •  •  •  ,  N. 
As  a  corollary  to  theorem  1,  one  has  the  following  result  on  the  value  of  an 
information  structure. 

Corollary  1.  If  &  is  an  optimal  team  decision  function  with  respect  to  an  in¬ 
formation  structure  77,  then 

(3.2) 


7(77)  -  Ed'n  -  (Ed)\Eu). 
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A  second  corollary  concerns  the  expected  value  of  an  optimal  decision  func¬ 
tion. 

Corollary  2.  If  a  is  optimal  for  some  77,  then 

(3.3)  Ed  -  Q~lEn. 

In  view  of  corollaries  1  and  2  to  theorem  1,  there  is  no  essential  loss  of  gen¬ 
erality  in  assuming  En  =  0.  As  a  consequence  one  has 

(3.4)  V(rj)  =  Ed!\i , 

(3.5)  Ed  =  0. 

The  second  theorem  deals  with  the  important  special  case  in  which  the  infor¬ 
mation  variables  and  the  random  coefficients  m  are  normally  distributed. 

Theorem  2.  Suppose  that  rji,  •  •  •  ,  tin  are  vector  valued ,  and  that  771,  •  •  ■  ,  rjN  and 
H 1,  *  •  •  ,  fiN  are  jointly  normally  distributed;  then  for  the  optimal  team  decision  func¬ 
tion,  ai  is  a  linear  function  of  yi. 

4.  Generation  of  information  structures  by  processes  of  observation, 
communication,  and  computation 

The  several  information  structures  to  be  considered  in  the  following  sections 
can  all  be  viewed  as  being  generated  by  certain  processes  of  observation,  com¬ 
munication,  and  computation.  Suppose  that  there  are  N  persons  and  that  person 
i  observes  a  random  variable  f  i(x)  and  takes  action  a*.  If  there  is  no  communica¬ 
tion  among  the  persons,  then  person  i’ s  information  function  is  iu(x)  =  ft(x). 
On  the  other  hand  if  there  is  complete  communication  among  the  persons,  then 
T)i(x)  =  {(x)  =  [fi(x),  •  •  *  ,  fAr(.r)].  Alternatively,  the  latter  information  struc¬ 
ture  could  be  generated  by  all  persons  communicating  their  observations  to  a 
central  agency,  which  computes  the  best  actions,  and  communicates  them  to  the 
corresponding  persons.  Still  different  information  structures  are  generated  if 
errors  are  introduced  into  the  communications  to  or  from  the  central  agency  or 
between  team  members. 

Rarely  does  one  encounter  in  a  real  organization  the  extremes  of  no  com¬ 
munication  or  complete  communication  just  described.  Rather,  one  finds  that 
numerous  devices  are  used  to  bring  about  a  partial  exchange  of  information.  The 
usefulness  of  such  devices  is  of  course  measured  by  the  excess  of  the  additional 
value  (expected  payoff)  they  contribute,  over  the  costs  of  installing  and  operat¬ 
ing  them.  Some  simple  devices  of  this  kind  will  be  examined  in  the  following 
sections.  For  example,  if  each  person  i  disseminates  some  contraction  of  his 
own  observation,  say  r4[f,(.r)],  to  all  other  persons  in  the  team,  then  the  resulting 
information  structure  is 

(4.1)  y\i  =  (f»,  r),  i  —  1,  *  *  *  ,  A, 

where  r  =  (n,  •  •  •  ,  rN),  A  different  type  of  “partial  decentralization^  is  achieved 
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by  partitioning  the  persons  into  groups,  with  complete  communication  within 
groups,  and  no  communication  between  groups. 

A  third  type  of  “partial  decentralization”  is  suggested  by  the  phrase  “manage¬ 
ment  by  exception.”  For  example,  suppose  that  the  possible  values  of  person  i’s 
observation  are  partitioned  into  two  subsets,  Ri  and  72,-,  labeled  “exceptional” 
and  “ordinary,”  respectively.  Suppose  further  that  whenever  person  i’s  observa¬ 
tion  is  “ordinary”  he  bases  his  action  upon  that  observation  alone,  whereas 
whenever  his  observation  is  “exceptional”  he  reports  it  to  a  central  agency,  or 
manager,  who  then  decides  the  values  of  all  action  variables  corresponding  to 
exceptional  observations,  on  the  basis  of  all  those  exceptional  observations.  The 
information  thus  generated  is,  for  each  i, 


(4.2) 


fftOc),  if  fi(x)  G  %i, 

if  f»0*0  £  Rit 


and  might  be  called  “reports  of  exceptions.” 

In  certain  of  the  information  structures  investigated  in  this  paper  it  is  assumed 
that  the  observational  functions  fi,  •  •  ■  ,  fv  are  statistically  independent.  This 
does  not  mean  that  the  information  functions  rji  for  the  several  team  members 
are  statistically  independent;  on  the  contrary,  such  dependence  is  introduced 
when  communication  takes  place.  It  would  also  be  of  interest,  of  course,  to 
study  the  effect  of  dependence  among  the  observations  themselves.  However,  as 
the  reader  will  soon  see,  the  picture  is  complicated  enough  with  independent 
observations,  and  it  has  seemed  best  at  this  time  to  leave  the  study  of  dependent 
observations  for  certain  structures  to  a  separate  investigation. 

A  special  case  of  interest  is  the  one  in  which 


(4.3) 


f»(x)  —  /i«(x), 


where  is  the  coefficient  of  a{  in  the  quadratic  payoff  function  (2.1).  This 
case  will  be  called  the  case  of  cospecialization  of  action  and  observation  since  in 
this  case  each  person  observes,  in  a  sense,  the  first  order  effect  of  his  own  action 
variable  upon  the  team  payoff.  This  concept  was  introduced  by  Marschak. 

In  order  to  see  more  clearly  the  effects  of  interactions  between  action  vari¬ 
ables  in  the  payoff  function  (as  measured  by  the  coefficients  for  i  ^  j),  it  will 
from  time  to  time  be  of  interest  to  consider  the  special  case  in  which 

(4.4)  qa  =  q\l2q)i\  *'  ^  3- 

By  suitable  changes  in  units  of  the  action  variables,  this  can  be  transformed 
into  the  case 


(4.5) 


1.  i  =  3, 

q,  i  7*  3- 


This  will  be  called  the  case  of  identical  interaction.  It  is  noteworthy  that,  in 
order  for  the  matrix  ((g,,))  of  (4.5)  to  be  positive  definite,  it  is  necessary  and 
sufficient  that 
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(4.6) 


1 


N  -  1 


<  Q  <  1, 


which  in  this  case  is  equivalent  to 


(4.7) 


-1 


d2o) 

ddiddj 


< 


_1 _ 

n  -i 


5.  Complete  communication,  complete  information,  and  routine 

Complete  communication  among  the  team  members  results  in  providing  all 
team  members  with  the  same  information  on  which  to  base  their  decisions. 
Should  this  resulting  common  information  be  sufficient  to  determine  the  best 
possible  decision  function  (that  is,  that  decision  function  that  would  be  optimal 
if  the  team  had  complete  information  about  the  state  of  the  world),  then  one 
is  in  the  special  case  of  complete  information.  At  the  other  extreme  is  the  case 
in  which  the  team  members  base  their  decisions  upon  the  knowledge  of  the 
probability  distribution  of  the  states  of  the  world  only,  which  corresponds  to  no 
observation  at  all.  This  will  be  called  the  case  of  “routine.” 

These  three  special  cases  are  typically  too  extreme  to  be  of  practical  interest 
in  an  organization  of  any  complexity.  Nevertheless,  they  are  useful  as  “base 
lines”  from  which  one  can  measure  the  effects  of  other  information  structures. 
Thus,  in  equation  (2.5)  the  value  of  an  information  structure  rj  has  beeft  defined 
as  the  maximum  expected  payoff  using  rj ,  minus  the  maximum  expected  payoff 
using  the  “routine”  information  structure.  From  the  other  side,  it  is  of  interest 
to  calculate  the  loss  due  to  using  77  as  compared  with  complete  communication, 
or  complete  information. 

In  the  special  case  of  cospecialization  of  action  and  observation  (see  section  4), 
complete  communication  is  equivalent  to  complete  information,  as  will  be  shown 
below. 

Denoting  the  observation  of  person  i  by  £%(x),  as  in  the  previous  section,  the 
information  structure  called  complete  communication  is  defined  by 

(5.1)  Vi(x)  =  f(x), 
where 

(5.2)  t(x)  s  free),  •  •  •  ,  for(*)]. 

As  is  well  known  for  the  case  of  the  quadratic  payoff  function  (2.1),  the  best 
decision  function  is  a  linear  transformation  of  the  conditional  expectation  of  /x 
given  f.  This  is  easily  seen  by  applying  theorem  1,  whose  condition  (3.1)  reduces 
in  this  case  to 

(5.3)  £  qijotj  =  0*10,  i  =  1,  •  ■  •  ,  N, 

or  more  concisely 

(5.4) 


Qa  =  E(n  |f). 
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The  optimal  team  decision  function  under  complete  communication  is  therefore 

(5.5)  d  =  Q-Wf). 

In  the  special  case  in  which  £(x)  =  xy  the  team  has  complete  information , 
that  is, 

(5.6)  rji(x)  =  x}  i  =  1,  •  •  •  ,  N. 

For  this  case  (5.5)  implies  that  the  best  team  decision  function  is 

(5.7)  p  =  Q-V. 

Henceforth  the  symbol  /3  will  denote  the  best  decision  function  under  complete 
information  as  given  by  (5.7).  Note  that  &(x)  depends  upon  x  through  n  only; 
hence  complete  knowledge  of  \i  is  sufficient  to  allow  each  team  member  to  use  0. 
From  this  it  follows  that  in  the  case  of  cospecialization  of  observation  and  in¬ 
formation,  complete  communication  is  equivalent  to  complete  information. 
Routine  is  defined  by 

(5.8)  rji(x)  =  constant  (independent  of  x),  i  =  1,  •  •  •  ,  N. 

Under  routine  any  team  decision  function  is  a  constant  vector,  say  a,  and  the 
best  value  of  this  vector  is 

(5.9)  &  =  QrlE{p)y 

as  is  easily  seen  by  applying  theorem  1.  Recall,  however,  the  normalizing  as¬ 
sumption  E(i u)  =  0,  which  with  (5.9)  implies 

(5.10)  &  =  0. 

It  follows  immediately  from  (5.10)  that  the  maximum  expected  value  under 
routine  is  zero.  Thus  by  the  normalization  of  E(p)  =  0,  the  value  of  any  informa¬ 
tion  structure  [equation  (2.3)]  and  maximum  expected  payoff  under  that  struc¬ 
ture  become  identical. 

The  value  of  complete  information  is  easily  inferred  from  (3.4)  to  be 

(5.11)  Vi  =  Ep'Q-lp. 

Also  applying  (3.4),  the  value  of  complete  communication  is 

(5.12)  =  E[E(n\t)'QrlE(p\t)]. 

The  loss  due  to  using  complete  communication,  relative  to  complete  informa¬ 
tion,  is  obtained  by  subtracting  (5.12)  from  (5.11),  which  yields 

(5.13)  Lk  =  E{[n  -  EQAfiYQ-'b  ~  «)]}• 

Considering  now  the  special  case  of  identical  interaction  (4.5),  the  inverse  of  Qf 
which  will  be  denoted  by  ((< qif )),  is  given  by 


(5.14) 


i  9*  3, 
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where 

(5.15)  D  =  (\-q)[l  +  [N  —  l)g]. 

From  (5.11),  (5.14),  and  (5.15)  one  can  compute  the  value  of  complete  informa¬ 
tion  in  this  special  case,  obtaining 


[1  +  (A  -  2)g] 


5Z  sii  51  8ij} 

i  =  i  tvy 


where 

(5.17) 

This  can  be  rewritten  as 


»</  = 


1’.  =  [(**  "  s*-) 


((/Sjv  —  5y) 


[1  +  C(Ar 


5y)  I 

-ml 


where 


Sat  —  A  ®««» 


s-v  _  #(jv  - 1)  <5 s,> 


Note  that  %  is  the  average  variance  of  nh  •  *  ■  ,  ma;  and  sn  is  the  average  co- 
variance  of  different  m  and  My- 
Hence  if 


(5.20) 


exist,  then 


s  =  lim  sN,  s  =  lim  sn, 

N-+° o  N-*  oo 


(5.21)  lim  (Vi/N)  =  ° - z 

Ar->«  1  “  g 

Furthermore,  in  the  special  case  in  which  %  and  5a  are  constant  (with  respect 
to  N),  the  approach  to  the  limit  in  (5.21)  is  either  monotonically  increasing  or 
monotonically  decreasing  according  as  qs  is  greater  than  or  less  than  5. 

In  other  words,  in  the  special  case  considered,  returns  to  scale  for  complete 
information  approach  a  constant  as  the  size  of  the  team  gets  large,  and  during 
this  approach  returns  to  scale  are  increasing  or  decreasing  according  as  os  is 
greater  than  or  less  than  5. 

For  the  case  in  which  the  Mi  are  uncorrelated  (5.18)  reduces  to 

(z  99)  v  _  AMI  +  (N  -  m 

(5— }  1  -  (1  -  q)[l  +  (N  -  m 

=  Sxf(N,  q), 


where 


(5.23) 


f(N,  q ) 


ATI  +  (N  -  2)<yl 
[1  -  ?][1  +  (N  -  1)?] 


These  last  formulas  will  appear  useful  in  later  sections. 
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6.  No  communication,  and  a  case  of  complete  informational 
decentralization 

In  the  absence  of  communication,  the  information  of  team  member  i  is 

(6.1)  Vi  =  ti, 

where  f*  is  his  own  observation.  Without  further  specification  of  the  ft-  it  does 
not  appear  that  anything  interesting  beyond  theorem  1  can  be  said  about  the 
solution.  Two  specializations  will  be  considered  here:  first,  the  case  of  statis¬ 
tically  independent  observations;  and  second,  the  case  of  cospecialization  of  ob¬ 
servation  and  action. 

In  the  case  of  independent  observations,  it  will  be  shown  that  the  value  of  the 
information  structure  is  the  sum  of  the  values  that  the  components  Vi  would 
have  in  “one-person”  problems  with  payoff  functions 

(6.2)  2ni(x)ai  -  qual 

Specifically,  I  will  show  that  the  value  of  such  an  information  structure  is 

(6.3)  F,=  £  f  i'W.lf;)*]. 

1  =  1  q  a 

Before  turning  to  the  demonstration  of  (6.3),  consider  the  effect  of  adding 
the  assumption  of  cospecialization  (f»  =  m).  In  this  case  (6.3)  becomes 


(6.4)  F2  =  Zf’ 

i  yit 

where,  as  before,  $u  =  Ent .  This  will  be  called  the  case  of  complete  informational 
decentralization ,  that  is, 


(6.5) 


Vt  =  f»  (no  communication), 

^ft  =  m*  (cospecialization), 

Mi,  •  •  •  ,  hn  independent. 


i  =  1,  •  •  •  ,  N; 
i  =  1,  ■  *  *  ,  N; 


The  values  of  the  coefficients  qa  are  not,  of  course,  invariant  under  a  change  of 
units  in  which  the  variables  a»  are  measured;  by  appropriate  changes  of  units, 
together  with  corresponding  changes  of  the  coefficients  m  and  corresponding 
changes  in  their  variances  and  covariances  one  can  achieve 

(6.6)  qu  =  1,  i  =  1,  *  *  *  , 

and  also 

(6.7)  V2  -  L  *«■ 

i 


Hence,  for  constant  sN  [see  (5.19)]  the  value  of  the  information  structure  (6.5) 
is  simply  proportional  to  N,  that  is,  complete  informational  decentralization  ex¬ 
hibits  constant  returns  to  scale. 

To  demonstrate  (6.3),  first  note  that  if  vi,  •  ■  *  ,  Vn  are  statistically  independent, 
then  for  any  team  decision  function  a, 
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(6.8)  E(aj\rn)  =  Ectj,  i  ^  j. 

In  other  words,  person  i’ s  information  does  not  help  him  to  predict  person  j’s 
action.  By  (3.5),  then,  any  optimal  team  decision  function  a  satisfies 

(6.9)  E(<Xj\vt)  =0,  iV  j. 

Applying  this  to  condition  (3.1)  of  theorem  1  gives 

(6.10)  g„a.  =  E(m\rn),  i  =  1,  •  •  •  ,  N, 


(6.11) 


i  =  1,  •  •  •  ,  N, 


for  the  optimal  team  decision  rule.  Equation  (6.3)  now  follows  easily  using  (3.4) 
and  of  course  (6.1). 

Even  without  the  assumption  of  independence  of  observations,  further  infor¬ 
mation  about  the  solution  in  the  case  of  cospecialization  can  be  obtained  under 
the  further  assumption  of  normality  of  hi,  •  •  •  ,  hn.  By  theorem  2,  components 
of  the  optimal  team  decision  rule  are  each  linear,  that  is,  for  some  constants 

bi,  •  •  *  ,  6j\r, 

(6.12)  oti  =  bifiij  i  =  1,  •  ■  •  ,  N. 

Hence,  again  using  the  normality, 


E(otj\rn)  =  bj  m. 


Applying  (6.12)  and  (6.13)  to  (3.1)  of  theorem  1, 


QubiHi  +  E  Qijbj  (t2  J  Hi  =  Mi, 
\*ii  / 


Since  (6.14)  must  hold  for  (almost)  all  values  of  hu 


(6.15) 

which  can  be  rewritten 

(6.16) 


Qabi  +  L  Qubj  (r2)  =  1> 

\Sii/ 


E  QijSijbj  —  S{ 


i  - 1, 


i  =  1,  •  *  •  ,  N, 


1 


Letting  H  =  s  =  the  vector  with  coordinates  sn,  •  •  •  ,  sNN ,  and  b  =  the 

vector  with  coordinates  &!,••*,  bN,  the  solution  of  (6.16)  can  be  expressed  as 

(6.17)  b  =  H-'s. 

Note  that  since  ((<7*/))  is  positive  definite  and  ((s<y))  is  nonnegative  semidefinite, 
H  is  positive  definite.  (H  is  the  so-called  Hadamard  product  of  ((</»;))  and  (($»/)) ; 
see  Halmos  [2],  section  69.) 

To  get  the  value  of  the  information  structure  in  this  case,  applying  (3.4)  gives 

(6.18)  F2  =  E  E  «»Mi 


=  E  E  biH2t  =  b's. 
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By  (6.17)  this  last  gives  a  value  of 

(6.19)  V2  =  s’H-'s. 


The  following  special  case  is  of  interest.  Suppose 


(6.20) 


(6.21) 


»-{ 


if  i  —  j, 

if  i  j*j; 
if  i  =  j, 
if  i^j; 


where  —  (1/N  —  1)  <  q  <  1  and  —  (1/iV  -  1)  g  r  S  1.  Then  the  value  as 
given  by  (6.19)  reduces  to 

N 


(6.22) 


y2  = 


1  +  (N  —  l)qr 


If  qr  9*  0,  then  V2  approaches  (1/gr)  as  a  limit  as  N  gets  large.  On  the  other 
hand,  if  qr  =  0,  then  V2  =  N.  In  other  words,  in  this  special  case  of  “identical 
interaction”  and  “identical  correlation”  with  cospecialization  of  action  and  ob¬ 
servation,  the  value  of  no  communication  approaches  a  ( finite )  limit  as  the  number 
of  variables  N  increases  without  limit,  if  neither  the  interaction  nor  the  correlation 
is  zero. 


7.  Partitioned  communication 

The  results  for  no  communication,  with  independent  observations,  extend 
easily  to  the  case  in  which  the  team  members  are  partitioned  into  a  set  of 
groups  h,  such  that  complete  communication  takes  place  within  each  group,  but 
no  communication  takes  place  between  groups.  Thus  let 

(7.1)  r*  - 

then  the  information  structure  under  discussion  is  defined  by 

(7.2)  vi(x)  =  f*  if  i  G  Ik . 

The  results  of  this  section  might  be  thought  of  as  describing  certain  types  of 
partial  informational  decentralization. 

Denoting  by  ak  and  nk  the  vectors  consisting  of  those  components  of  a  and  n, 
respectively,  corresponding  to  the  fcth  group,  and  by  Qk  the  corresponding  sub¬ 
matrix  of  Q,  then  by  reasoning  similar  to  that  of  section  6  the  reader  can  verify 
easily  that  the  best  team  decision  function  is 

(7.3)  a*  =  QllE{uk  |f*), 
and  that  the  value  of  this  information  structure  is 

(7.4)  Vz  =  Z  E[E(S\n'QllE(uk\n] 

k 

(with  fi ,  -  *  •  ,  fjv  assumed  independent).  Actually,  for  this  result  it  is  sufficient 
that  the  f*  be  independent. 
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In  the  case  of  cospecialization  of  action  and  observation  (4.3) ,  the  information 
structure  (7.2)  reduces  to 

(7.5)  if  i  G  h, 
and  yields  a  value,  by  (7.4),  of 

(7.6)  Vz  =  L  E  q'hu, 

k  i  (£h 

where 

(7.7)  Qt'  -  m) 

(recall  ^i,  *  ■  •  ,  m  are  uncorrelated). 

In  the  special  case  of  identical  interaction  (4.5),  the  value  (7.6)  reduces  to 
U.8)  Vz  =  Z  Skf(Mk,  q), 

k 


where  Mk  is  the  number  of  persons  in  group  fc, 


(7.9) 


8k  -  71 /f  S 

Mki&k 


and  f(Mk}  q)  is  given  by  (5.23)  [apply  (5.14)].  In  particular,  if  all  groups  are  of 
equal  size  M,  then  the  value  is 


(7.10) 


Vi  =  s{^)fiM’ q)> 


where  s  =  (1  /N)  E  [compare  with  the  value  of  complete  information  in 
equation  (5.21)]. 

Figure  1  shows  Vz  as  a  function  of  M ,  for  s  =  1,  N  =  100,  and  three  dif¬ 
ferent  values  of  q. 

On  the  other  hand,  if  a  group  (say  the  first)  has  M  members,  and  each  of  the 
rest  has  only  one  member,  then  the  value  is 

(7.11)  Vz  =  §i f(M,q)  +  E  8u. 

i(£Ii 


8.  Dissemination  of  independent  information 

As  noted  in  the  last  section,  partitioning  of  persons  (or  action  variables)  is 
one  way  of  moving  away  from  complete  informational  decentralization  towards 
identical  information.  Another  way  is  provided  by  the  system  that  will  be  called 
here  dissemination  of  information .  Specifically,  consider  a  situation  in  which  each 
team  member  communicates  some  function  of  his  observations,  that  is,  some 
statistic  to  a  “central  agency, which  then  compiles  (but  does  not  “process”) 
all  these  reports  and  distributes  this  compilation  to  all  the  members. 

I  will  show  that  the  value  of  such  an  information  structure  can  be  expressed 
exactly  as  a  sum  of  two  parts,  one  part  attributable  to  the  disseminated  infor¬ 
mation,  and  one  part  attributable  to  the  undisseminated  information. 

To  define  the  information  structure  precisely,  for  each  i  suppose  the  observa¬ 
tion  function  $*»  takes  values  in  some  set  Zt,  and  let  n  be  a  function  on  Zt*.  The 
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variable  ti  =  Ti(zi )  is  to  be  interpreted  as  the  ith  member’s  report  to  the  central 
agent.  Let  t(x)  =  [ti(x),  •  •  •  ,  tat(^)]  ;  then  define  the  information  structure  by 

(8.1)  iu(x)  =  [f,(z),  t{x)],  i  =  1,  •  •  •  ,  N. 

The  variable  t  =  (th  •  •  •  ,  tN)  is  to  be  interpreted  as  the  compilation  sent  out 
by  the  central  agent  to  all  the  team  members.  I  consider  here  only  the  case  in 
which  the  observations  Zi  are  statistically  independent.  I  also  omit  the  possibil¬ 
ity  that  the  central  agent  further  reduces  the  compilation  r( x)  to  some  summary 
statistic  before  sending  it  out  to  the  team  members.  (The  “central  agent”  here 
does  not  himself  directly  control  any  action  variable.) 

Define  pi  and  pi  by 

,g  „  Myi)  =  E(ni\y>), 

m  =  E(nM). 

I  will  show  that  the  optimal  decision  functions  are 

(8.3)  —  A<(<)], 

a 

where,  as  in  section  5,  ft  is  the  best  team  decision  function  under  complete  in¬ 
formation,  and  is  given  by  (5.7)  as  0  =  Q"V- 
The  corresponding  expected  payoff  will  be  shown  to  be 

(8.4)  V4  =  Ep'Q-'fi  +  Er  ~  EP2<)> 

i  Qii 

which  can  be  shown  to  be  equivalent  to 

(8.5)  f4  =  Ep.'Qr'a  +  E  —  E\V ar  (ftlOJ. 

i  Qii 

Note  that  the  first  term  of  (8.5)  is  the  maximum  expected  payoff  that  could  be 
obtained  if  all  team  members  had  only  the  information  function  t;  whereas  the 
second  term  is  a  weighted  sum  of  terms,  each  of  which  measures  the  degree  to 
which  that  person  can  predict  his  m  better  on  the  basis  of  yi  than  on  the  basis 
of  t  alone. 

Again,  before  demonstrating  these  facts,  we  will  consider  a  special  case.  Sup¬ 
pose  (in  addition  to  the  assumptions  already  made)  that  the  /n  are  independent, 
and  that  each  /xt  is  independent  of  {&},*».  (This  would  be  the  case  if,  for  exam¬ 
ple,  each  person’s  observation  Zi  consisted  of  an  estimate  of  m(x),  both  in  and 
the  error  being  independent  of  the  /q  and  errors  of  the  other  persons.)  It  is 
shown  below'  that  in  this  case 

(8.6)  Pi  =  EQu  If.-), 
and  that  the  value  of  the  information  structure  is 

(8.7)  F4  =  £  qliEfi  +  E  -  (Eu2t  -  Eft), 

i  t  Qti 


where,  as  before,  ((giy))  =  Q"1. 
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Again  [as  in  (8.4)]  the  first  sum  in  (8.7)  is  the  maximum  expected  payoff  that 
could  be  obtained  if  all  team  members  had  only  the  shared  information  t;  whereas 
the  second  term  measures  the  additional  value  of  each  individual's  knowing  the 
part  of  his  own  observation  that  he  did  not  share. 

Another  interpretation  of  (8.7)  is  suggested  by  rearranging  the  terms  to  give 

(8.8)  V,  =  £  ^  +  Z  («"  -  *  )  Efil 

i  Q  it  x  \  Qii/ 

The  first  sum  in  (8.8)  is  what  the  maximum  expected  payoff  would  be  if  the  ith 
member  knew  only  f »  (see  section  6  on  no  communication) ;  the  second  sum  is 
the  additional  value  attributable  to  the  dissemination  of  n,  •  •  •  ,  tn . 

Turn  now  to  the  derivation  of  the  optimal  team  decision  functions  and 
expected  payoffs.  I  shall  use  the  following  lemma,  the  proof  of  which  is  given 
in  [5]. 

Lemma.  Let  A,  C,  and  G  be  independent  random  variables  {not  necessarily 
real);  let  B  be  a  contraction  of  A,  and  D  a  contraction  of  C;  and  let  F  be  a  real 
random  variable  defined  by  F  =  /(A,  D,  G),  where  f  is  some  given  measurable 
function;  then 

(8.9)  E{F\B,  C,  (?)  =  E(F\B ,  D ,  <?). 

In  the  present  situation,  the  above  lemma  applies  to  give 

(8.10)  E{cij\yt)  =  E{cl,\t)  if  i  ^  j. 

This  can  be  seen  by  taking  (in  the  notation  of  the  lemma) 


A  =  fy, 

D  =  Tif 

(8.11) 

B  =  Tj, 

-se 

II 

c  =  f„ 

/  =  <Xj. 

From  (8.10)  it  follows  that  condition  (3.1)  of  theorem  1  reduces  in  this  case  to 

(8.12)  quai  +  £  QijE{aj\r )  =  jXi}  i  =  1,  •  •  •  ,  N. 

j 

Applying  the  lemma  again  to  pi,  the  conditional  expectation  of  (8.12),  given  r,  is 

(8.13)  Z  QtjE(ajW)  =  A,-,  i  =  1,  •  •  •  .  N. 

j 

Subtract  (8.13)  from  (8.12), 

tfi.Oi  -  E(ch\t)]  =  pi  -  Pi; 

(8.14) 

c*i  =  E(ai\r)  +  —  (pi  —  p,). 

y  i  i 

On  the  other  hand,  solving  (8.13)  for  E(a\r)  gives 

(8.15)  E(a  |r)  =  Q-lE(»\r)  =  Etf  |r). 

Substitution  of  this  into  (8.14)  gives  the  best  team  decision  function 
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(8.16)  Attot)  =  E(m  +  -f-  -  Mtil 

<£ii 

from  which  the  values  as  given  by  (8.4)  and  (8.5)  easily  follow. 

Equation  (8.6)  follows  directly  from  the  lemma,  under  the  assumptions  of  the 
special  case,  by  taking 

(8.17)  A  =  (m;,  ft),  B  =  C  =  {ry},v t,  D  constant,  /  =  m. 


9.  Error  in  instruction 


Consider  a  team  with  complete  communication  to  a  central  agent,  in  which 
the  best  team  decision  0,(x)  is  computed  by  the  central  agent,  and  each  team 
member  is  sent  a  message  instructing  him  about  the  appropriate  action 
Suppose,  however,  that  the  actual  message  received  by  member  i  is  not  the  cor¬ 
rect  value  Pi(x),  but  a  value  equal  to  the  correct  value  plus  some  random  error. 
To  be  precise,  suppose  that  the  information  to  member  i  is  given  by 

(9.1)  yi  s  m(x)  =  pi(x)  +  €,(x), 

where  pi(x)  is  the  best  decision  function  for  i  under  complete  information  (sec¬ 
tion  5),  and  u{x)  is  an  error  term. 

Each  team  member  can,  of  course,  simply  follow  the  ‘ ‘instruction”  t/*  with  the 
error,  as  he  receives  it.  Indeed,  this  might  at  first  appear  to  be  the  correct 
procedure  if  u  is  independent  of  /3t,  and  has  mean  zero.  However,  we  shall  show 
that  the  team  can  do  better  if  each  team  member  is  provided  with  a  decision 
rule  that  adjusts  the  received  instruction  in  a  suitable  way.  It  will  be  shown  that 
the  proper  adjustment  for  any  one  person  depends  in  general  upon  all  the  inter¬ 
actions  qij ,  and  that  even  if  only  some  of  the  team  members’  instructions  are 
erroneous,  all  team  members  should  typically  make  some  adjustments. 

Throughout  this  section  I  will  assume  that  (a)  p(x)  and  e(x)  are  normally 
distributed,  (b)  0  and  e  are  independent  of  each  other,  (c)  the  components  €*•  are 
(mutually)  independent.  There  is  no  loss  of  generality  in  further  assuming  that 
(d)  Efi  =  Ee  =  0. 

I  first  give  the  results  for  this  information  structure,  including  those  for  certain 
special  cases,  deferring  the  proofs  to  the  end  of  the  section. 

First,  denote  the  relevant  variances  and  covariances  by 


(9.2)  rn  =  Efifih 

(9.3)  t\  =  Eel 


Further,  define  the  numbers  /l;,  vi}  and/ij’  by 

fqu(ru  +  tl),  if 


(9.4) 


h  = 


''Qijl'ijt 


Vi  =  q«ti, 


i  =  j, 

*  ^  j, 


«/”))  =  (c 


(9.5) 

(9.6) 
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(9.7)  h  =  1  —  Zf%. 

i 

I  will  show  that  the  best  team  decision  function  is 
(9-8)  ai(yi)  =  biyi 

and  that  the  resulting  value  of  this  information  structure  is 
(9.9)  F6  -  Zfijbibj. 

ij 

Thus  each  adjustment  factor  bu  depends  upon  all  the  parameters  g*y,  r*y,  and  t\. 

Note  that  if  all  the  error  variances  are  very  small,  then  the  best  decision  b$ji 
is  very  close  to  yi  (set  all  t\  =  0).  On  the  other  hand,  if  the  error  variances  are 
very  large,  compared  to  the  variances  rtt  of  the  piy  then  6t  will  be  close  to  zero. 

It  is  also  interesting  that  even  if,  for  some  particular  i,  the  error  variance  t < 
is  zero,  the  adjustment  factor  bi  will  in  general  be  different  from  1.  In  other  words, 
error  in  the  instruction  to  some  team  members  should  cause  other  team  members 
to  adjust  their  actions  accordingly  even  if  the  former  are  receiving  error-free  in¬ 
structions. 

A  special  case.  Before  demonstrating  these  results,  consider  the  special  case 
in  which  all  interactions  are  identical,  all  correlations  between  different  Pi  and  pj 
are  identical,  and  all  error  variances  are  identical,  that  is, 


/!, 

i  =  j, 

Qij 

W. 

i  *  j, 

n, 

i  =  j, 

Tii  = 

ic, 

i  ^  j, 

t2i  = 

t2. 

Having  taken  ru  =  1,  the  parameter  t  is  to  be  interpreted  as  the  ratio  of  the 
error  variance  to  the  (common)  variance  of  the  Pi. 

In  this  case  the  adjustment  factor  bi  [see  (9.7)]  reduces  to 


(9.11) 


bi=  1  - 


_ t 2 

1  +  t2  +  (N  -  1  )qc 


and  the  value  of  the  information  structure  is 


(9.12) 

Thus  the  term 


_  N [1  +  [N  -  1  )qcl* 
5  1  +t2  +  (N  -  1  )qc 


(9,13)  1  +  t2  +  (N  -  1  )qc 

is  the  “correction”  subtracted  by  person  i  from  the  instruction  yi  that  he 
receives.  Here  it  is  quite  easy  to  see  that  if  there  are  no  errors  ( t 2  =  0),  then  the 
correction  is  zero ;  whereas  as  t2  gets  large,  the  correction  tends  to  cancel  out  the 
information  completely,  that  is,  t2/[  1  +  t2  +  (N  —  1  )qc]  tends  to  1. 
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Similar  remarks  apply  to  the  value  V*>  [equation  (9.12)].  When  t2  =  0  one  gets 
the  value  of  complete  information 

(9.14)  N[l  +  (N-  l)qc]; 

but  when  t2  gets  large,  the  value  Vs  approaches  zero. 

Proof  of  results.  By  theorem  2  the  components  of  the  optimal  team  deci¬ 
sion  function  are  linear,  say 

(9.15)  cLi(yi)  =  biVi. 

Hence  condition  (3.1)  is 

(9.16)  qubiVi  +  £  qiJbjE(yj\yl)  =  E{yi\yx)i  i  =  1,  •  •  ,  N. 

j^i 

From  assumptions  (a),  (b),  and  (c)  it  follows  that 


(9.17) 


E(vj\yi)  = 


Zi&±  , 
ru  +  t\ 


E(fr\yi)  = 


mm 

ru  +  i\ 


i  **  r, 

all  i  and  j. 


The  function  p  is  related  to  by 

(9.18)  m(z)  =  £  qaPAx), 


since  P  is  the  optimal  team  decision  function  under  complete  information  [condi¬ 
tion  (5.7)].  Substituting  (9.17)  and  (9.18)  in  the  stationarity  condition  (9.16) 
gives 


(9.19) 


qabi  ~f"  £  qijbj 
j^i 


which  reduces  to 


(9.20)  quiru  +  t\)(b<  -  1)  +  L  q^bj  -  1)  =  -qj\. 

jf^i 


Solution  of  this  system  for  the  values  (6,  —  1)  gives  (9.7),  which  completes  the 
derivation  of  the  best  team  decision  function.  The  value,  equation  (9.9),  is  ob¬ 
tained,  with  some  straightforward  algebra,  by  substituting  the  decision  func¬ 
tion  of  (9.7)  and  (9.8)  in  the  payoff  function  and  taking  the  expected  value. 

To  derive  the  results  for  the  special  case  (9.10)  I  use  the  fact  that  the  inverse 
of  an  N  X  N  matrix  ((w„))  of  the  form 


(9.21) 

is 


i  =  3, 

i  ^  j, 


m*'  = 


u+[N  —  2]w 
D 


i  =  j, 


(9.22) 


i  ^  j, 
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where 

(9.23)  D  =  (u  —  w)[u  +  (N  —  l)w] 

[compare  with  (5.14)  and  (5.15)]. 

The  matrix  ((/„•))  of  (9.4)  is  of  this  form  (9.21),  under  the  assumptions  (9.10); 
hence  the  inverse  is 


(9.24) 


+  t  +  {N  — 
D 


=32. 

_  D 


i  =  h 
i  ^  h 


D  =  (1  +  t2  —  qc)[  1  +  t2  +  (N  -  1  )qc]. 


Simple  algebra  now  yields  (9.11)  and  (9.12)  from  the  general  expressions  (9.7) 
and  (9.9). 


10.  Complete  communication  of  erroneous  observations 

In  the  preceding  section  I  considered  the  effects  of  errors  in  instructions  from 
a  “central  decision  agency”  to  the  individual  team  members.  In  this  section  I 
shall  consider  the  effects  of  errors  in  the  information  provided  by  the  team  members 
to  such  a  central  agency. 

For  this  information  structure  I  consider  only  the  case  of  cospecialization  of 
action  and  observation  (ft  =  in)-  Suppose  that  each  team  member  sends  a 
message  consisting  of  the  value  m(x)  plus  an  error  u{x)  to  a  central  agency.  On 
the  basis  of  the  messages  received  from  all  N  team  members,  the  central  agency 
then  computes  the  best  decision  for  each  team  member,  and  communicates  this 
to  him  (error  free).  Note  that  in  this  case  all  N  decisions  are  based  upon  the 
same  information.  To  be  precise  the  information  structure  to  be  discussed  is 

(10.1)  r ]i(x)  =  [in(x)  +  €i(x),  •  •  •  ,  yN(x)  +  eN(x)]f  for  all  i. 

Note  that  this  information  structure  is  formally  equivalent  to  that  of  com¬ 
plete  communication  of  “observations”  in  +  et;  in  particular  the  results  for  this 
structure  follow  directly  from  those  of  section  5,  which  are  repeated  here  for 
convenience.  The  best  team  decision  function  is 

(10.2)  a{y)  =  Q~1E(ji\y)9 

with  a  corresponding  value 

(10.3)  Vo  =  E{E(n\y)'Q~lE(n\y)}  =  Z  ?<y  Cov  [E(m\y),  E(jtj\y)], 

ij 

where  (( qif ))  is  the  inverse  of  the  matrix  ((<?#)). 

Various  special  cases  are  of  interest.  If  the  m  and  €»  are  all  statistically  inde¬ 
pendent,  then 

(10.4) 


E(m\y)  =  E(m\in  +  *»). 
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If  further,  the  m  and  u  are  normally  distributed  (with  means  that  can  be  taken 
to  be  zero),  then 


(10.5)  E(m\ni  +  u)  =  — ^  (Mi  +  *0 

where  and  4  =  £(«<).  In  this  case  the  best  team  decision  function, 

and  corresponding  value,  are,  respectively, 


(10.6) 

(10.7) 


<*«  = 


Fe  = 


_ Su _ \ 

1  +  t2i/su) 


If  one  further  specializes  by  assuming  that 

Ed  =  s2 

(10.8)  Et\  =  i2 


the  value  becomes 
(10.9)  Vt 


Ars2[l  +  (N-  2)<?1 
(l  +  |)  (1  -  <?)[!  +  (N  —  l)g] 


The  reader  can  verify  that  for  t2  =  0  (no  error),  Fe  equals  the  value  of  complete 
information,  whereas  as  ( t2/s 2)  gets  large,  V6  approaches  zero  as  a  limit. 

Proof  of  results.  The  results  (10.2)  and  (10.3)  follow  directly  from  (5.6) 
and  (5.12).  The  special  case  (10.9)  is  similar  to  that  discussed  in  the  previous 
section. 


11.  Management  by  exception:  Reporting  exceptions 

The  term  “management  by  exception”  covers  a  number  of  organizational 
devices  whereby  the  decision  about  a  given  action  variable  is  normally  made  on 
the  basis  of  relatively  few  information  variables,  but  may  be  made  on  the  basis 
of  more  information  if  the  original  information  variables  take  on  “exceptional” 
values.  In  this  and  the  next  section  I  analyze  two  such  “management  by  excep¬ 
tion”  devices.  The  first  might  be  called  “reporting  exceptions,”  or  more  ac¬ 
curately,  if  somewhat  colloquially,  “passing  the  buck.”  The  second  device, 
discussed  in  the  next  section,  can  be  described  as  “emergency  conference.”  The 
comparison  of  the  various  information  structures  considered  in  this  paper,  which 
is  made  in  section  13,  tends  to  confirm  the  widely  held  belief  that  management 
by  exception  can  provide  a  relatively  efficient  way  of  utilizing  information. 

I  shall  analyze  these  particular  management  by  exception  information  struc¬ 
tures  in  the  context  of  cospecialization  of  action  and  observation  (f»  =  #*<). 
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Before  giving  a  precise  definition  of  reports  of  exceptions,  the  following  descrip¬ 
tion  may  be  helpful.  Suppose  that  for  each  team  member  i,  the  range  of  possible 
values  of  m(x)  is  divided  into  two  parts,  “ordinary”  values  and  “exceptional” 
values.  Let  Ri  denote  the  set  of  “exceptional”  values.  If  in  a  particular  instance, 
member  i  observes  m(x)  to  be  not  exceptional,  that  is,  not  in  Riy  then  he  chooses 
a  value  of  his  action  variable  a{  on  the  basis  of  m(x)  only,  according  to  some 
decision  function,  say  y{.  On  the  other  hand,  if  he  observes  m(x)  to  be  excep¬ 
tional,  that  is,  in  Riy  then  he  reports  that  value  to  a  “central  agency.”  The 
central  agency  then  makes  the  decision  about  the  values  of  the  decision  variables 
of  all  team  members  i  who  have  reported  exceptional  observations,  on  the  basis 
of  all  those  exceptional  observations. 

More  precisely,  the  information  structure  to  be  analyzed  in  this  section  is 
defined  as  follows.  For  each  i ,  let  Ri  be  a  given  subset  of  the  real  line  [the  “ex¬ 
ceptional”  values  of  and  for  each  state  of  nature  let  J(x)  be  the  set  of 

all  j  such  that  mix)  £  Rj.  Then  the  information  structure  ??  is  defined  by 


(11.1) 


7}i(x) 


if  ^  Riy 

~{^j{x)} j^j(x)y  if  £:  R^ 


Note  that  (11.1)  defines  a  class  of  information  structures,  a  particular  structure 
being  determined  by  a  particular  choice  of  the  exception  sets  Ri,  •  •  •  ,  Rn- 

Such  an  information  structure  can  of  course  be  described  in  a  somewhat  more 
general  context  than  the  one  used  here.  The  basic  idea  is  that  the  variables 
directly  observed  by  member  i  have  “exceptional”  and  “ordinary”  values;  if 
they  are  ordinary,  he  makes  the  decisions  about  his  action  variables  just  on  the 
basis  of  that  information;  if  they  are  exceptional,  the  decisions  are  made  by  an 
agent  on  the  basis  of  all  the  exceptional  information  (and  possibly  other  infor¬ 
mation  as  well). 

In  what  follows  it  is  assumed  that  the  variables  m(x)  are  statistically  inde¬ 
pendent,  with  means  zero  and  variances  si  It  is  also  assumed  that  each  m{x) 
has  a  distribution  that  is  symmetrical  about  its  mean,  zero.  Likewise,  we  only 
consider  exceptional  sets  Ri  that  are  symmetrical  around  zero,  that  is,  if  m  is 
in  Ri  then  so  is  —m. 

It  will  be  seen  that  in  this  case  the  following  parameters  are  of  central  im¬ 
portance  in  evaluating  the  information  structure  corresponding  to  a  particular 
choice  of  Ri,  •  •  ,  Rn, 

Pi  =  P\jn(x)  G  Ri\, 

sli  =  Var  \jn(x)\fjLi(x)  £  Ri]. 


Thus  Pi  is  the  frequency  with  which  the  variable  m(x)  turns  out  to  be  exceptional ; 
and  s%{  is  the  conditional  variance  of  m(x),  given  that  it  is  exceptional.  The 
larger  piy  the  more  frequently  the  action  variable  a*  is  determined  by  the  central 
agent,  and  the  larger  will  be  the  (gross)  expected  payoff.  Of  course,  the  greater 
the  frequency  of  exceptions,  the  more  costly  one  can  expect  such  an  information 
structure  to  be. 
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Value  of  information  for  groups  of  equal  size.  N  =  100. 


It  will  also  appear  that,  other  things  being  equal,  the  larger  the  conditional 
variances  sm,  the  larger  the  gross  expected  payoff.  This  is  not  implausible,  in 
view  of  the  quadratic  payoff  function.  The  precise  result  is  this:  given  the  prob¬ 
abilities  pi,  •  -  -  ,Pn,  the  optimal  choice  of  Ri  is  that  which  maximizes  s|i,  and 
this  is  achieved  by  taking  Ri  to  be  the  complement  of  an  interval  symmetric 
around  zero.  Note  that  in  this  case  the  values  in  Ri  are  indeed  “exceptional”  in 
the  usual  sense  of  being  farther  from  the  average  than  the  “ordinary”  values. 

Before  deriving  the  formulas  for  the  best  decision  rules  and  the  value  of  this 
type  of  information  structure,  I  present  the  results  of  some  numerical  computa¬ 
tions.  For  the  purposes  of  these  computations,  it  was  assumed  that 
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Figure  2 

Reports  of  exceptions: 

V  as  a  function  of  q  for  N  =  10  and  p  =  .1,  .2. 


(11.3) 


fl>  i  =  h 
W,  i  ^  h 


(11.4)  m(x)  is  normally  distributed,  with  mean  0  and  variance  1,  for  each  i. 


(This  is  the  special  case  of  “identical  interaction”  that  has  been  discussed  in 
several  previous  sections.) 

Taking  all  the  exception  sets  R *  to  be  identical,  and  choosing  them  in  the  best 
way  (subject  to  the  constraint  of  symmetry),  the  values  of  the  information 
structures  were  calculated  for  various  values  of  the  parameters:  q ,  degree  of 
interaction;  N ,  number  of  action  variables;  p,  probability  of  a  value  of  m(x) 
being  exceptional.  The  parameters  q  and  N  are  to  be  thought  of  as  “technolog- 
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Reports  of  exceptions: 

V  as  a  function  of  q  for  N  =  100  and  p  —  .1,  .2. 


ical,”  whereas  p  is  a  parameter  of  the  information  structure,  to  be  chosen  by 
the  organizer. 

It  should  be  noted  that  the  parameters  sRi  [see  (11.2)]  are  all  equal,  because 
the  sets  Ri  are  identical;  furthermore,  their  common  value  is  determined  by  p, 
once  the  distribution  of  m(x)  is  given,  and  the  best  choice  of  Ri  is  made.  It 
might  also  be  noted  that  the  effect  of  assuming  the  variances  of  the  m(x)  to  be, 
say,  s 2  instead  of  1,  would  be  to  multiply  all  the  computed  values  by  s 2. 

First  consider  the  effect  of  changing  the  interaction  parameter  g.  Figures  2 
and  3  show  the  value  V  of  the  information  structure,  as  a  function  of  q ,  for  dif¬ 
ferent  pairs  of  values  of  p  and  N.  As  the  figures  illustrate,  the  value  is  greater, 
the  larger  g,  rising  slowly  when  q  is  near  zero,  and  then  more  rapidly  as  q  ap¬ 
proaches  1.  Note,  too,  that  the  increment  in  value  due  to  going  from  p  =  .1  to 
p  =  .2  is  larger,  the  larger  q. 
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Reports  of  exceptions: 

V  as  a  function  of  p  for  N  —  100  and  q  —  .5. 

Figure  4  shows  the  effect  of  changing  p,  the  relative  frequency  of  exceptions, 
for  fixed  values  of  q  and  N.  As  one  would  expect,  the  value  V  increases  with  p; 
however,  each  successive  increment  of  p  produces  a  smaller  increment  of  value, 
so  that  p  has  “decreasing  marginal  value.”  This  latter  effect  is  quite  marked, 
in  this  example  at  least,  so  that  a  frequency  of  exceptions  of  1/3  has  achieved 
almost  80  per  cent  of  the  possible  increase  in  value. 
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Reports  of  exceptions  and  complete  information : 

V/N  as  a  function  of  N  for  q  —  .5  and  p  —  .1. 

Turning  to  figure  5,  which  shows  the  effect  of  changing  N,  one  sees  that  as  N 
increases,  with  p  and  q  fixed,  the  value  V  increases  more  than  proportionately. 
This  is  illustrated  in  the  figure  by  plotting  {V/N)  as  a  function  of  N  (the  lower 
curve).  Recall  that  under  these  particular  assumptions  the  value  of  complete 
information  also  increases  more  than  proportionately  with  N  [see  equations 
(5.22)  and  (5.23)];  this  is  shown  by  the  upper  curve  in  figure  5.  As  inspection 
of  the  two  curves  shows,  (V/N)  approaches  a  constant  much  more  rapidly  for 
complete  information  than  for  reports  of  exceptions. 

As  N  increases,  the  expected  number  of  exceptions  Np  increases  proportion¬ 
ately.  If  the  costs  of  dealing  with  these  exceptions  were  proportional  to  the 
average  number  of  exceptions,  then  we  would  have  here  an  example  of  increasing 
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Reports  of  exceptions: 

V/N  as  a  function  of  N  for  q  =  .5  and  Np  -  1,  10. 

returns  to  scale  in  the  size  of  the  organization  arising  from  the  use  of  this  type 
of  information  structure. 

Finally,  figure  6  shows  the  effect  of  increasing  N  while  simultaneously  decreas¬ 
ing  p  so  that  the  expected  number  of  exceptions  Np  remains  constant.  These 
curves  show  that  although  in  this  case  total  value  increases  with  N,  it  does  so 
less  than  proportionately  to  N.  As  N  increases  without  limit,  the  ratio  V/N 
decreases  to  the  limiting  value  1,  which  is  the  value  of  V/N  for  complete  de¬ 
centralization  in  this  case. 

Derivation  of  best  decision  functions.  Assume  that 

(i)  the  distribution  of  each  pi  is  symmetrical  around  its  mean,  which  can  be 
taken  to  be  zero. 

(ii)  Each  exception  set  Ri  is  symmetric  around  zero. 

(iii)  m,  •  •  •  ,  pN  are  statistically  independent. 

Recall  that  J(x )  denotes,  for  each  x,  the  set  of  indices  of  those  variables  pj(x) 
that  have  exceptional  values,  that  is, 

(11.5)  J(x)  m  {j\pj(x)  e  Rj}. 

Denote  by  pJ{x)  the  vector  of  those  coordinates  of  p  for  which  j  is  in  J(x);  and 
denote  by  Qj(X >  the  matrix  of  those  elements  qa  of  Q  for  which  i  and  j  are  in 
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J(x).  In  this  notation,  the  information  structure  to  be  analyzed  can  be  described 
by 


(11.6) 


Vi(x)  = 


/».(*),  if 

HJ(x),  if  i^J{x). 

Consider  now  the  particular  team  decision  function  Si  defined  by 


(11.7) 


qu 

~[Qj(x)VJ~\i(x) , 


if  i@J(x), 
if  i  E  J(x). 


In  other  words,  the  decision  function  &  just  defined  directs  team  member  i 

(i)  to  take  that  action  that  would  be  appropriate  under  “complete  decentral¬ 
ization/ J  if  he  observes  an  unexceptional  value  of  /*», 

(ii)  to  take  that  action  that  would  be  appropriate  under  “partitions  with 
independent  information,”  with  i  in  the  group  J(x),  if  he  observes  an  exceptional 
value  of  Mt* 

I  shall  now  show  that  Si  satisfies  (3.1)  and  is  therefore  optimal. 

First  note  that 


(11.8) 


0, 

<0, 

WVL, 


if  i  E  J( x), 

if  j  E  J(x)> 

if  f  E  T(x),  j  E  J(x). 


This  follows  from  the  independence  and  symmetry  of  the  m  distribution  and  the 
symmetry  of  the  sets  /?,.  Therefore,  if  i  E  J (x) 

(H-9)  £  =  ff«(^)  = 


and  if  i  E  </(x), 

(li-io)  23  qijE[&j\i)i]  =  53  goCQJ  VL  =  Mi- 

3  jE:J 


Equations  (11.9)  and  (11.10)  together  verify  that  Si  satisfies  (3.1)  and  is  there¬ 
fore  optimal. 

Computation  of  the  value  of  the  information  structure.  According  to  (3.4)  the 
expected  payoff  yielded  by  the  best  team  decision  function  Si  is 


(11.11) 


V 7  =  E  E  Hjdj. 

3 


I  shall  now  show  that  V7  is  given  by  equation  (11.23)  below.  Given  any  particular 
set  K 


(11.12)  E{V\J(x)  =  K}  =  E{  £  vA  +  E  =  K} 

j^K  i<£K 

=  E{(»kYQk'hk \n{x)  E  Rj  for  j  E  K} 

+  23  f  e{£ m*)  e  e.} 

i&Kqu 
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sit  =  E{fi2i\m(x)  Hi } 

1  s2m  s  E{^|m;(*)  G  £;} 

(11.14)  =  Ah  diagonal  element  of  Q*1. 

Then  (recalling  that  the  juy  are  independent) 

(11.15)  J£{(M*)'QiVk  G  By  for  JGK}=  EffM/ 

JGK 

and  (11.12)  can  be  rewritten, 

(11.16)  E{V\J(x)  =  K)  =  E  ?I4+  Z  GW 

Denote  by  p>  the  probability  that  m>(x)  is  exceptional,  that  is, 

(11.17)  Vi  =  P{N(x)  G  B,}. 

Then  the  probability,  for  a  given  set  K ,  that  J ( x )  =  A"  is 

(11.18)  P(A)  =  n  Vi  n  (1  ~  Py), 

ye*  >€* 

and  taking  the  expected  value  of  (11.16)  gives 

(11.19)  v^  =  E  P(K)  [  Z  +  Z  (f )  s«l- 

all  K  L;GA  JeK\qjj /  J 

This  last  can  be  put  into  a  more  useful  form  if  one  interchanges  the  order  of 
summation  over  the  sets  K  and  the  index  j  of  the  team  members,  thus, 

(11.20)  Ft  =  Z  I"  Z  P(K)qUj  +  Z  P(K)  (f)  4,1 

j  =  1  LA3j  K&j  \<ln/  J 

=  Z  U,  z  P(K)qU  +  (pjslj  E  P(K) } 

i- 1  L  *3/  \9«/  K2>j  J 

First  note  that 

(11.21)  5  TO  =  (1  ~  Pi). 

K%3 

Second,  one  can  write 

(11.22)  E 

J3j 

where  by  convention  one  takes  q JJ  =  0  if  j  J.  Substituting  these  last  two  equa¬ 
tions  in  (11.20)  gives 

(11.23)  V7  =  £  [slMqU)  +  4X1  -  Vi)  (^)} 

This  is  the  formula  I  shall  use  in  the  further  analysis  of  the  value  of  this  infor¬ 
mation  structure. 

One  can  now  show'  that,  given  p,,  the  best  set  Rj  is  the  complement  of  an 
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interval  (symmetric  around  zero,  by  assumption).  First  by  the  symmetry  as¬ 
sumptions,  the  variance  of  /xy  is  related  to  the  conditional  variances  s%j  and  soj  by 

(11.24)  s§  =  pjsl)  +  (1  -  pj)s20J. 

Therefore,  choosing  the  sets  R,  to  maximize  F7  for  given  probabilities  pj  is 
equivalent  to  choosing  the  conditional  variances  s%j  and  s%j  to  maximize  V7, 
subject  to  (11.24)  and  s%jy  Soj  ^  0.  This  can  be  done  by  making  s%  as  large  as 
possible  if 

(11.25)  E{&x)\J{x)  Bj}  ^  ~ 

Hi) 

Now  note  that,  since  the  matrix  Q  is  positive  definite  (and  hence  so  is  every  Qj) 

(11.26)  ^  1,  all  j  G  J, 


with  strict  inequality  unless  J  =  {j}  or  qjk  =  0  for  all  k  ^  j.  Condition  (11.25) 
is  therefore  always  satisfied. 

Consider  now  a  special  case.  Suppose  that  all  the  variances  $<,  s%iy  and  s%i  are 
the  same,  and  equal  to  s2,  s|,  and  So,  respectively,  and  suppose  that  all  the  sets 
Ri  are  identical,  with  pi  =  p.  Let  M (x)  denote  the  number  of  elements  in  J(x) ; 
then  M(x)  has  the  binomial  distribution  B(p,  N ).  Define  f*(M)  and  g(M)  by 


(11.27) 


f*(M)  M(x)  =  M}, 

3  EJ  Or) 

9(M)=e\  £  (f)|MW=4 
ly^y(x)  \q»/  J 


Equation  (11.23)  for  V7  now  reduces  to 

(11.28)  F7  =  s’  [(^)  Ef*(M[x])  +  (^)  Eg(M[x])~\. 


In  particular,  in  the  case  of  “identical  interaction” 


(11.29) 


Qij 


-f 

W, 


if  i  =  j, 
if  i  t*  j, 


t  follows  from  (5.14)  and  (5.2)  that 

r(M)  . ^ 


(11.30) 


(1  -  q)[  1  +  (M 
g(M )  =  N  -  M, 


1)?] 


so  that  W  is  given  by 

(U.3.)  V,-  *.[(4)  MMp]  +  (1  -P)(|)} 

This  is  the  formula  used  in  the  computation  of  the  numerical  results  described 
earlier  in  this  section,  with  s2  =  1  and  the  m  normally  distributed.  There  seems 
to  be  no  convenient  closed  expression  for  Ef[M(x),  g]. 

Under  the  assumption  of  normality,  with  s2  =  1,  one  has  the  following  rela- 


520  FOURTH  BERKELEY  SYMPOSIUM:  RADNER 

tionship  between  p,  s%,  and  the  interval  [ — r,  r]  that  defines  the  complement  of  R : 

(11.32)  p  =  2  j*  <p{t)  dt, 

(11.33) 


.2  _  2ry(r) 

*  -  p  + 


where  <p(t)  =  1/V^irexp  {—t2/ 2}.  Formula  (11.33)  is  derived  easily  using  inte¬ 
gration  by  parts.  From  (11.24)  of  course,  one  has 

(11.34)  p«l  +  (1  -P)4  =  1. 

Value  of  information  for  large  N.  In  the  special  case  covered  by  (11.31), 
(1  /N)f(M,  q )  can  be  written 


(11.35) 


fSMdH 

N 


(f) 

‘  /AT  -  2\  1 

.,+(  «  k 

(1-9] 

{‘+1 

(M  —  1\  "1 
<  N  )q\ 

lim  ,  J_. 

N  1  —  q 


N- 


Hence,  by  the  law  of  large  numbers 

(11.36) 

Together  with  (11.31),  this  last  implies 

(U.37)  lim  (?)  +  <!-*»  (?)) 


12.  Management  by  exception:  “Emergency  conference” 


In  the  last  section  it  was  assumed  that  the  decisions  about  only  those  variables 
corresponding  to  “exceptional”  information  were  taken  jointly,  whereas  the 
decisions  about  the  other  variables  were  taken  independently.  Another  manage- 
ment-by-exception  type  of  information  structure,  which  might  be  labeled  “emer¬ 
gency  conference,”  stipulates  that  whenever  any  information  variable  takes  on 
an  exceptional  value,  all  decisions  are  taken  jointly.  More  precisely,  I  will 
analyze  the  following  information  structure : 


(12.1) 


if  for  every  j ,  ^(x)  Rj, 

if  for  some  j ,  £  Rj , 


where  Riy  •  •  •  ,  Rn  are  given  subsets  of  the  real  line. 

Let  R  be  the  set  of  states  of  nature  x  for  which  at  least  one  of  the  values  yj(x) 
is  exceptional,  that  is, 

(12.2)  R  =  {x\  for  some  jf  nj(x)  £  Rj}. 

It  is  clear  that  when  the  state  of  nature  x  is  in  R,  then  the  team  is  in  a  situation 
of  complete  information,  whereas  when  x  is  not  in  R ,  then  the  team  is  in  a  situa¬ 
tion  of  complete  decentralization,  facing  a  conditional  distribution  j*,  given  that  x 
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is  not  in  R.  If  /*i,  •  •  •  ,  w  are  independent,  as  I  shall  assume  in  this  section,  then 
they  will  also  be  conditionally  independent,  given  that  x  ^  it. 

As  before,  it  turns  out  that  the  important  parameters  of  the  exception  sets 


Ri  are 

(12.3) 

Pi  =  P{m(x)  £  ii!,-}, 

(12.4) 

s2,  =  Var  (M<), 

(12.5) 

sit  =  Var  [pi\iu(x)  Q  Ri]. 

Indeed  I  will  show  that  (assuming,  as  we  can  without  loss  of  generality,  that 
Ey,  =  0)  the  value  of  the  information  structure  (12.1)  is 

(12.6)  F8  =  £  9«8?  -  [Z  (<r  -  A]  P{ x  $  R), 

where  ((qij))  =  Q-1,  and 

(12.7)  p{x  &  r}  =  n  a  - 

i 

It  will  also  be  shown  that,  given  piy  •  •  ,  pN,  the  best  choices  of  the  sets  Ri  are 
the  complements  of  intervals. 


Figure  7 

Emergency  conference: 

V  as  a  function  of  p  for  q  —  .5  and  N  —  100. 
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In  particular  if 


Pi  =  p 

Si  =  s, 

2  2 
Soi  —  So 

f1, 

i  =  j 

Qn  =  I 

i  ^  j, 

then  expression  (12.6)  reduces  to 

(12.9)  Vs  =  s'-mq)  -  «g(  1  -  v)NU(N)q)  -  N], 

where  f(N,q)  is  given  by  (5.23). 

Figure  7  shows  the  value  of  emergency  conference  as  a  function  of  p,  for 
q  =  .5  and  N  =  100,  as  given  by  (12.9).  Note  that  the  value  rises  extremely 
rapidly  for  small  values  of  p,  so  that  by  the  time  p  has  reached  .05,  the  increase 
in  value  over  p  =  0  is  97  per  cent  of  the  total  possible  increase  (p  =  1).  This  is 
to  be  expected  when  N  is  fairly  large,  since  it  takes  only  one  exception  to  con¬ 
vene  the  entire  “conference,”  and  bring  about  a  state  of  complete  information. 
The  probability  of  one  or  more  exceptions  occurring  is  1  —  (1  —  p)  v. 

Figure  8  shows  V/N  as  a  function  of  N  (with  N  varying  from  1  to  100),  for 
q  =  .5,  and  p  =  .01  and  .1.  As  N  increases,  for  fixed  (positive)  p,  the  probability 


Figure  8 

Emergency  conference: 

V/N  as  a  function  of  N  for  q  =  .5  and  p  —  .01,  .1 
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Emergency  conference: 

V/N  as  a  function  of  N  for  q  =  .5  and  (1  —  p)N  =  .99). 

of  a  “conference,”  that  is,  of  at  least  one  exception  occurring,  converges  rapidly 
to  one.  With  p  =  .1,  by  the  time  N  has  reached  40  one  is  practically  in  a  situa¬ 
tion  of  complete  information. 

The  last  remarks  suggest  looking  at  how  V/N  behaves  as  a  function  of  N, 
when  the  probability  of  a  “conference”  is  kept  constant.  Figures  9  and  10  show 
two  such  curves  (for  q  =  .5),  the  first  with  1  —  (1  —  p)N  held  constant  at  .01, 
and  the  second  with  1  —  (1  —  p)N  held  constant  at  .10.  These  figures  reveal 
that  for  any  given  value  of  the  probability  of  a  “conference,”  there  is  a  value  of  N 
that  maximizes  V/N.  In  other  words,  with  the  probability  of  a  conference  fixed, 
there  are  decreasing  returns  to  scale  after  some  point.  This  is  in  contrast  with  the 
corresponding  case  for  “reporting  exceptions,”  as  exemplified  in  figure  5  of  the 
previous  section.  Note  that  the  decreasing  returns  to  scale  in  the  present  case 
occurs  even  though  the  average  size  of  the  conference  i\T[l  —  (1  —  p)N]  is  in¬ 
creasing.  If  the  average  size  of  the  conference  were  to  be  held  constant,  the 
tendency  towards  decreasing  returns  to  scale  would  be  more  marked.  This  last 
situation  is  the  one  that  is  comparable  to  figure  6  of  the  previous  section. 

Best  team  decision  functions.  Consider  now  the  information  structure  of 
(12.1),  with  arbitrary  sets  Ri,  •  *  *  ,  Rn ,  and  assume  that 
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Emergency  conference: 

V/N  as  a  function  of  N  for  q  =  .5  and  (1  —  p)N  —  .90. 


(12.10)  m,  •  •  •  ,  hn  independent, 

(12.11)  E(ju)  =  0,  Var  (Mt)  =  si 
Define  m°  and  a0  by 

(12.12)  m°t  =  ^0*,|M4(*)  $  Ri}. 

(12.13)  a0  =  Q-'m°. 


By  applying  theorem  1  it  can  be  shown  that  the  best  team  decision  function  <S 
is  given  by 


(12.14) 


o «  + 


m° 


so-^m 


if  x  R] 

if  x  S  R- 


The  proof  is  routine,  and  is  omitted. 

Value  of  the  information  structure.  Again  we  consider  the  two  cases  x  R 
and  x  S  R  separately,  by  writing  the  value  of  information  as 

(12.15)  Vg  =  E{u[x,  <$(«/)]} 

=  E{u>\x  &  R}P\(x  g  ft)  +  E{o> \x  £  R}P(x  e  R). 
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Because  &  satisfies  condition  (3.1)  of  theorem  1  in  each  case  (x  R,  x  £  R) 
separately,  one  can  apply  equation  (3.4)  to  each  case.  After  some  calculation, 
this  application  yields  equation  (12.6). 

Note  that  since  Q  is  positive  definite,  qH  ^  1/qu,  so  that  the  term  in  brackets 
on  the  right  side  of  (12.6)  is  nonnegative.  The  quantity  £  q{is\  is  the  value  of 
complete  information  under  the  current  assumptions. 

In  the  special  case  described  by  (12.8)  it  follows  easily  from  (5.4)  that  the 
value  Vs  reduces  to  the  expression  given  in  (12.9). 

Best  choice  of  the  exception  sets .  Given  the  probability  of  a  conference, 
P{x  (E  R}  =  1  —  P{x  R},  the  choice  of  the  sets  Rif  •  •  •  ,  RN  that  maxi¬ 
mizes  the  value  (12.6)  is  the  choice  that  minimizes 

(12.16)  Wgii  -i-\  ^ 

i  \  qu/ 

subject  to 


(12.17)  n  (1  -  Pi)  =  P{x  $  R} 

i 


(the  pi  and  the  sh  being  of  course  related).  In  particular  given  the  values  of 
Pi,  *  *  •  ,  Pn ,  the  expression  (12.16)  is  minimized  by  taking  each  set  Ri  to  be  the 
complement  of  some  interval ,  symmetric  around  zero  ( the  mean  of  m).  This  char¬ 
acteristic  is  therefore  true  of  the  best  choice,  given  only  the  value  of  P{x  R}. 

In  the  case  of  symmetric  sets  Ri,  one  has  a0  =  m°  =  0,  so  that  the  best  team 
decision  function  is  given  by 


(12.18) 


>»(s) 

Qu  ’ 


if  x  R; 
if  x  £  R. 


13.  Comparisons  anong  the  several  information  structures 

In  this  section  I  shall  present  comparisons  among  the  several  information 
structures  that  have  been  considered  in  the  previous  sections.  These  comparisons 
will  be  made  for  the  special  case  of  cospecialization  of  action  and  observation, 
with  identical  interactions,  and  independent  observations  with  identical  vari¬ 
ances. 

The  first  set  of  comparisons  is  among  the  four  information  structures  (1) 
partition  into  equal  groups;  (2)  partition  into  groups  with  only  one  group  having 
more  than  one  member;  (3)  emergency  conference;  and  (4)  reports  of  exceptions. 
As  will  be  seen,  these  four  structures  are  comparable  in  the  sense  that  structures 
(3)  and  (4)  can  be  viewed  as  resulting  from  variable  partitioning  into  groups, 
the  particular  partition  used  depending  upon  the  information  signals  that  are 
actually  received  by  the  team  members.  It  will  be  seen  that  if  one  compares 
structures  of  the  above  four  types  with  the  same  average  group  size,  then  the 
above  list  is  in  the  order  of  increasing  value. 

This  result  can  be  explained  heuristically  as  follows.  Under  the  assumptions 
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described  above,  the  “technology”  of  the  team  exhibits  increasing  returns  to 
scale,  that  is,  under  complete  information,  value  per  person,  V/N,  increases 
as  N  increases  (see  the  end  of  section  5).  With  independent  observations, 
partition  of  the  team  is  equivalent  to  substituting  for  the  original  team  a  collec¬ 
tion  of  smaller  teams,  with  the  same  total  number  of  members.  Because  of  the 
increasing  returns  to  scale,  if  the  number  of  groups  is  given,  the  best  allocation 
of  the  members  to  the  groups  is  achieved  by  assigning  as  many  members  as 
possible  to  one  group,  leaving  the  rest  of  the  groups  with  one  member  each. 
This  accounts  for  the  greater  value  of  (2)  as  against  (1)  in  the  above  comparison. 

The  superiority  of  (3)  and  (4)  over  (1)  and  (2)  is  plausible  when  one  sees  that 
structures  (3)  and  (4)  have  something  of  the  character  of  a  two-stage  sequential 
analysis.  Additional  information  is  brought  to  bear  on  decisions  only  under 
circumstances  in  which  additional  information  is  more  than  ordinarily  helpful. 
In  this  respect  “reports  of  exceptions”  is  more  selective  than  “emergency  con¬ 
ference,”  since  it  brings  the  additional  information  to  bear  upon  only  those 
action  variables  that  are  associated  with  the  unusual  observations,  rather  than 
upon  all  the  action  variables.  Indeed,  it  will  be  shown  that  for  large  values  of  A, 
“emergency  conference”  is  approximately  no  better  than  fixed  groups  with  only 
one  group  having  more  than  one  member. 

The  second  comparison  is  between  error  in  instruction  (section  9)  and  com¬ 
plete  communication  of  erroneous  observation  (section  10).  It  will  be  seen  that 
if  one  compares  structures  of  the  two  types  that  have  the  same  ratio  of  variance 
of  error  to  variance  of  message,  then  the  error  in  observation  type  is  preferable 
to  the  error  in  instruction.  This  is  related  to  the  fact  that  under  complete  in¬ 
formation,  with  nonzero  interaction,  the  optimal  decision  rules  for  the  several 
members  are  correlated  (section  5).  In  the  case  of  error  in  observation,  the  com¬ 
plete,  error-free  communication  makes  possible  any  desired  degree  of  correla¬ 
tion  between  the  decisions  of  different  team  members;  whereas  the  error  in 
instruction  introduces  a  lack  of  correlation  between  the  information  on  which 
different  decisions  must  be  based. 

General  remarks  on  comparisons  of  information  structures .  Before  going  into 
the  detailed  comparisons  of  this  section,  some  general  remarks  may  be  helpful. 
Ideally,  one  would  want  to  compare  information  structures  on  the  basis  of  net 
value  of  information,  namely  gross  value  of  information  minus  the  cost  of  both 
the  information  and  the  associated  best  decision  function.  Therefore,  any  com¬ 
parison  between  the  gross  values  of  two  information  structures  is  meaningful 
only  in  the  context  of  some  assumption  about  the  relative  costs  of  the  two 
structures.  Although  no  explicit  discussion  of  costs  is  presented  here,  certain 
assumptions  are  implicit  in  the  comparisions  made  below.  Thus,  in  the  com¬ 
parisons  among  information  structures  based  upon  fixed  or  variable  partitions 
into  groups,  the  implicit  assumption  is  that  costs  depend  upon  average  group 
size.  On  the  other  hand  the  comparison  between  error  in  instruction  and  error 
in  observation  is  meaningful  if  the  costs  depend  upon  the  ratio  of  the  variance 
of  the  error  to  the  variance  of  the  message. 
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Fixed  and  variable  'partitions.  Consider  now  the  case  of  cospecialization  of 
action  and  observation,  equation  (4.3),  together  with  the  special  assumptions 
of  identical  interactions  [equations  (4.4)  and  (4.5)],  and  independent,  normally 
distributed  observations  Hi  with  identical  variances.  There  is  no  further  loss  of 
generality  in  assuming  that  the  Hi  all  have  means  zero  and  variances  one. 

The  two  fixed-partition  information  structures  to  be  considered  are  (1)  parti¬ 
tions  into  equal  groups,  and  (2)  partitions  such  that  at  most  one  group  has  more 
than  one  member.  Under  the  above  assumptions  the  values  for  these  two  types 
of  structure  are  given,  respectively,  by 


(13.1)  V3  =  (J^f(M,q), 

(13.2)  Vi  =  f(M,  q)  +  {N  -  M), 

see  (7.10)  and  (7.11);  where  in  the  first  case  M  denotes  the  number  of  persons 
in  each  group,  and  in  the  second  case  M  denotes  the  number  of  persons  in  the 
one  group  that  can  possibly  have  more  than  one  member;  and  where /(n,  q),  as 
in  (5.23)  is  defined  by 


(13.3) 


f(n,  q) 


n[  1  +  (n  -  2)q] 

[1  -q][  l  +  (n-  1)?] 


The  two  variable  partition  information  structures  to  be  considered  are  “emer¬ 
gency  conference”  (section  12)  and  “reports  of  exceptions”  (section  11),  with 
values  given,  respectively,  by 

(13.4)  Vs  =  f(N,  q )  -  *8(1  -  p)Nlf(N,  q)  -  N], 

(13.5)  V7  =  AEf(M ,  q)  +  N(  1  -  p)«8, 

see  (12.9)  and  (11.31) ;  where  p  is  the  probability  that  a  value  of  an  observation 
Hi(x)  is  exceptional,  so  is  the  conditional  variance  of  Hi  given  that  it  is  not  excep¬ 
tional,  sr  is  the  conditional  variance  Hi  given  that  it  is  exceptional,  and  in  (13.5) 
M  has  the  binomial  distribution  B(p}  N).  Recall  that,  as  in  (11.34), 

(13.6)  psl  +  (1  —  p)*o  =  1, 

and  that  s|  and  si  are  determined  by  p  [see  (11.32)  and  (11.33)]. 

To  compare  the  values  of  the  above  four  types  of  information  structure,  I  will 
compare  structures  that,  roughly  speaking,  have  the  same  average  group  size. 
It  will  be  more  convenient,  however,  to  consider  explicitly,  for  any  fixed  N ,  the 
average  number  of  groups  associated  with  the  information  structure.  Thus,  for 
the  case  of  partitions  into  equal  groups  of  size  M,  the  number  of  groups  is 


(13.7) 


whereas  for  the  case  of  one  large  group,  of  size  M,  the  number  of  groups  is 

(13.8)  G  =  N  -  M  +  1. 
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Comparisons  among  fixed  and  variable  partitions: 

Fas  a  function  of  fixed  or  average  number  of  groups,  G. 

Curve  1 :  Fixed  groups,  equal  size. 

Curve  2:  Fixed  groups,  one  of  size  M. 

Curve  3:  Emergency  conference. 

Curve  4:  Reports  of  exceptions. 

For  the  two  variable  partition  cases,  the  number  of  groups  is  a  random  variable. 
For  “emergency  conference”  the  expected  number  of  groups  is 

(13.9)  EG  =  N(  1  -p)N  +  1  -  (1  -  p)N ; 
for  “reports  of  exceptions”  the  expected  number  of  groups  is 

(13.10)  EG  =  N(1  —  p)  +  1  —  (1  —  p)N. 

Figure  11  shows  value  V  as  a  function  of  G  (or  EG)  for  the  above  four  types 
of  information  structure,  with  N  =  100  and  q  =  .5.  In  the  fixed  partition  cases, 
G  is  varied  by  varying  M;  in  the  variable  partition  cases  EG  is  varied  by 
varying  p.  As  the  figure  shows,  “reports  of  exceptions”  gives  the  highest  value 
(for  G  different  from  1  or  A),  “emergency  conference”  gives  a  barely  higher 
value  (not  noticeable  on  the  plot)  than  fixed  groups  with  one  of  size  M,  and 
these  two  in  turn  give  a  higher  value  than  fixed  equal  groups. 

The  relations  among  the  above  four  types  of  information  structure  emerge 
quite  clearly  and  simply  for  large  values  of  N.  Suppose  that  as  N  increases  with- 
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out  limit,  the  (average)  number  of  groups  increases  proportionately,  so  that 
G  =  y N  (or  EG  =  yN).  It  is  easily  verified,  using  (13.1)  to  (13.10),  that  the 
limits,  as  N  increases  without  limit,  of  value  per  person,  V/N ,  for  the  four  types 
of  information  structure  are 


(13.11) 

Hmg-  ‘"W 

N  1  —  q 

(13.12) 

limjy  =lim^8 

(13.13) 
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It  is  clear  that  the  above  three  limiting  values  are  in  order  of  increasing  magni¬ 
tude,  except  when  q  =  0,  y  =  0,  or  7  =  1,  in  which  cases  all  three  limiting  values 
are  equal. 

Communication  errors .  Consider  now  the  two  information  structures,  “error 
in  instruction”  (section  9)  and  “error  in  observation”  (section  10).  The  discus¬ 
sion  will  proceed  under  the  same  special  assumptions  of  (1)  cospecialization, 
(2)  identical  interactions,  and  (3)  independent  and  normally  distributed  observa¬ 
tions  m  with  identical  variances.  However,  in  this  case  the  variance  of  /*»  will 
be  denoted  by  s2. 

The  two  information  structures  of  sections  9  and  10  are  comparable  in  that 
they  are  both  concerned  with  complete  communication  in  which  errors  are  in¬ 
troduced.  In  the  one  case,  however,  the  errors  are  introduced  at  the  points  at 
which  the  processed  observations,  that  is,  the  “instructions,”  are  being  com¬ 
municated  from  the  “central  agent”  to  the  team  members,  whereas  in  the  second 
case  the  errors  are  introduced  before  the  processing  of  information,  that  is,  in 
the  communications  of  the  observations  to  the  “central  agent.”  For  the  purpose 
of  the  comparison  to  be  made  here,  let  t 2  denote  the  common  variance  of  the 
several  errors,  which  will  be  assumed  to  be  independent,  normally  distributed 
variables,  with  zero  means,  and  uncorrelated  with  the  original  messages  (that 
is,  instructions  or  observations)  to  which  they  have  been  added.  It  seems  natural 
to  compare  information  structures  of  the  two  types  that  have  the  same  ratio  of 
variance  of  error  to  variance  of  the  original  message. 

For  the  special  case  being  considered  the  value  of  the  “error  in  instruction” 
information  structure  is 


(13.14) 

where 

(13.15) 


and  =  Q~1h  is  the  team  decision  function  that  would  be  best  for  complete 
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information.  Equation  (13.14)  is  another  version  of  equation  (9.12)  but  without 
the  normalizing  assumption  that  w  =  1.  Using  (5.14)  and  (5.15)  one  can,  after 
some  computation,  arrive  at 


(13.16) 

(13.17) 


w  =  s2 


(  (N  ~  2)<?2  +  2<y  \ 

\(N  -  1  )52  +  [1  +(N-  2 )q]*J 

({N  -  1)(?2  +  (1  +  (N  —  2k?\ 
V  [1  -  <zP[l  +  (N-  1)<?]2  ) 


From  (13.15)  to  (13.17)  it  follows  that 


(13.18) 
where 

(13.19) 


lim  m , 

iv-«  \N )  1  +  r  -  q 

=  -• 
w 


Note  that  r  is  the  ratio  of  the  variance  of  the  error  to  the  variance  of  signal  to 
which  the  error  is  added. 

The  value  of  the  “error  in  observation”  information  structure  is,  from  (10.9), 


(13.20) 

(13.21) 
where 

(13.22) 


N*  n  +  {N-  M 

[i  +  (§)]  [1  -  ff][l  +  (N-  l)g] 
{¥)  =  (1  +  rO(l  -  q) 


By  comparing  (13.18)  and  (13.21)  one  sees  that  if  r  =  r',  then 

<13-23>  iT.  (w  )  s  IS,  (w  } 


with  strict  inequality  if  r  and  q  are  strictly  positive. 
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